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1. INTRODUCTION 

Today, prostate cancer has become one of the most dangerous diseases that can affect the health of 
men, and the detection of the disease in early stages can help to get rid of the disease easily. The traditional 
methods for diagnosis the prostate cancer depended on individual biomarkers and tested the tissue and cell 
specimens. The genetic and age are the coefficients that help to diagnosis the prostate cancer, and they are the 
first data must be available to determine the disease. Prostate-Specific Antigen (PSA) is the protein 
biomarker that usually used to detect the patient of prostate cancer [1]. The range of the PSA is recorded low 
(less than two ng/mL) in the blood of the healthy men, and it is raised for the prostate cancer patients more 
than two [2]. Although the PSA is an important indicator that can detect the prostate cancer disease, the other 
multiple factors should be taken into account to the diagnosis of the illness such as age and genetic [3]. 

There are several studies have discussed the diagnosis of the prostate cancer in early stages applying 
more recent and effective methods. Kallen et al. [4] used an example of Convolutional Neural Network 
(CNN) and he applied both Support Vector Machine (SVM) and a pre-trained CNN for classification and 
feature extraction respectively. In automated classification of Glason grading, Gorelik et al. [5] segmented 
the microscopic images into a meaningful pathologically segments such as, Lumen, storma and lymphocyte 
etc. Doyle et al. [6] proposed a Bayseian multi-resolution method for classification using AdaBoost. 
In Torsten et al. [7] used a classification by learning vector quantization in which the Neural Networks 
training were adapted into different sample sizes per class. Another recent research article by 
Litjens et al. [8] using CNN for hematoxylin classification and eosin images of prostatic tissue. 

Wavelet networks were firstly introduced by Zhang and Benveniste [9] as an alternative approach to 
FeedForward Neural Networks (FFNN) that fix the drawbacks of NNs and Wavelet Analysis(WA) while it 
has the best performance using both of these approaches [10]. Chen, Feng [10] used Wavelet Neural Network 
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(WNN) in system modelling and time series prediction based on multiresolution learning. Their results 
indicated that WNN has a powerful approximation capability and suitability in prediction and modeling. 
Therefore, WNN is a significant tool in classification and signals processing [11]-[12]. 

In this article, a new approach for classification of prostate cancer’s risk is proposed using WNN. 
The simulation of WNN method is performed to get high classified results. The new approach simulation 
result is compared with the traditional classification approaches such as Neural Networks (NN).This article is 
organized as the follows. Part 2 introduces the data collection. Part 3 presents WNN with a description of its 
applied learning algorithm. In Part 4 the simulation results of both NN and WNN is summarized. 
Finally, Part 5 provides the conclusion of this paper. 


2. DATA COLLECTION 

Finding and collecting the data is very important for training and testing the WNN and NN. In this 
work, the data will be divided into three sections, Age, PSA and the Prostate volume. For human’s age, 
the risk of prostate cancer is highly expected starting from the age of 50 and above. PSA is a protein 
produced by the malignant and the normal cells of the prostate gland [13]. The range of PSA measurements 
ranges from 2 to 4 (ng/ml) in the blood for the healthy men. However, the indication of PSA varies with age 
as it shows a very high probability of cancer with PSA value more than 4 ng/ml and an age over 50. 
However, features of PSA diagnosing the disease decreases with patients of Prostate cancer with large 
volume of Prostate because the co-occurring decreases the effect of the cancer on PSA [14]. The volume of 
the prostate indicate an apparent symptom when it has enlarged affecting the urethra. That patients data have 
been provided by Baghdad Medical City hospital in April 2017. 


3. WAVELET NEURAL NETWORKS (WNNs) 

Wavelet Analysis is the result of both Fourier Transform (FT) and Window Fourier Transform 
(WFT) to improve the shortcomings of FT. Fourier transform is a common method in digital signal 
processing. However, when it was reported in the time series prediction, it showed the time limitations. WA 
is an analytical and mathematical tool for a wide range of research applications. Wavelet Analysis (WA) is 
used presently for time position, analysis of time series and intensity [15] The wavelet is a special waveform 
with a finite duration at the average zero point. The WA deploys a unique function introduced as the mother 
wavelet. This wavelet unique function is the result of a series of basic orthogonal sets composed of a father 
wavelet and a mother wavelet which satisfies: 


S g(t) dt=1 (1) 


Sp()dt = 0 (2) 


The wavelet family has what is called as the children of wavelet, which is a translated and dilated 
form of wavelet mother: 


Parl) == v (=) 


a 


(3) 


4. WAVELET NEURAL NETWORK CONFIGURATION (WNNC) 

The WNN configuration is presented and displayed in Figure 1. The significant structure of WNN is 
introduced as the following [16]: 
1. WNN Parameters initialization The matrix that exists between the input and the hidden layer is 


v= (Wik) ean 


The matrix that exists between the hidden and the output layer: V = (V; lee 


Hidden layer neuron’s Dilation vector: a; = (a, Aq yan Ap) 


Hidden layer neuron’s Translation vector: by = (br 1 » bp) 


Where m, p, n, stands for the input layer, hidden layer and the input layer of WNN respectively. 
The parameters initialization is done arbitrarily. 
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Figure 1. Wavelet Neural Network configuration 


2. Generate the output of the network by computing the forward pass by applying the following Equation: 


net) —p, 





Where: 
net\? = YM Wye Xx (5) 
The ith node output at the output layer is expressed as: 


yi Oba Vij * net{?) = f (net{”) (6) 


where X is the WNN input vector and it is expressed as X = (X1,X2,...,Xm). when X is achieved, the jth 
output node of the hidden layer can be found out. 
3. Compute the output total error by applying the following equation: 


—iye < 
E = 5 digqut hi(dgi _ Yai) 
(7) 

Where E stands for the total error, Q stands for training samples number for each sample q, tg stands for the 
desire vector. The desire vector is expressed as T, = (tg1,tg2,--»tgn), and y,; stands for the output vector 
and it is expressed byY, = (Vq1,Yq2)+»Yqn). To be able to achieve the output minimum total error is 
considered a significant advantage of WNN . 
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4. Find each parameter partial derivatives as expressed by 8W),? BVij’ da; and 0b, 
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a Vij —_ — a1 (tai a Yai) -Vqi - (1 —_ Yai) 7 Wa,v,(netG) (8) 
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= = —a;* > a [v'a,0,(net{?) , Paltu — Yai) Vai: (1 _— Vai) Vi (11) 


where 
(4) 
net. “—b; 
, net) —_ i , i j 
yb apejl qi faj wy aj 


5. Define the learning rate and momentum values as a= 0.99 and n = 0.2 to update WNN parameters as 
shown in Equations (12-15) [15, 16] 


Vist + 1D) = Vij) — 0 + alVis@ — Vist — D] 


(12) 
V(t + 1) = Vy) —n ge + a[Vy@ — Vi —1)] (13) 
a(t+1) =a,® — oe + alaj@) — a(t — 1)] (14) 
bt + D = b@) —nZy + ab) — be — 1D] (15) 


5. RESULTS AND DISCUSSION 

This part discusses WNN in diagnosis of prostate cancer. Three different items were used to 
measure prostate cancer risk, patient age, PSA, and prostate volume. In this article, WNN is applied as a new 
method: it consists of an input layer (m), hidden layer (p) and an output layer (n). The input layer, 
output layer and the hidden layer consists of three nodes, one node and seven nodes respectively as shown in 
the following equation [16]. 


Nhiaaen 2 (2*Ninput) +1 (16) 


Where Nnidden ANd Ninput denotes the neurons number in the hidden and the input layer respectively. 
Morlet function is deployed in the hidden layer as an activation function, and sigmoid function is deployed 
in output layer [11]. Equation 17 below indicates the Morlet function, as displayed in Figure 2. Table 1 
presents the inidication of th applied paprameters in this work. 


W(t) = cos (1.75 + thexp (—5) (17) 


Where, t is the summation output function in hidden layer. 
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Figure 2. Wavelet Morelt Function [11] 
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Table 1. Parameters Indication 








No. Parameter’s Name WNN NN 
1 Number of neurons Input layer 3 3 
2 Number of neurons Hidden layer 7 7 
3 Number of neurond Output layer 1 1 
4 Actvation function in hidden layer Morlet sigmoid 
5 Actvation function in output layer sigmoid tansig 
6 Learning rate value (n) 0.2 0.5 
7 Momentum value (a) 0.99 0.99 
8 iteration number 200 epoch 200 epoch 
9 Training method backpropogation 





Each network forms 200 epochs based on the data element network of Backpropogation's BP 
algorithm. Once the training is over,WNN is directly applied for the classification of prostate cancer risk. 
WNN performance can be estimated depending on the following indicators. 


5.1. Convergence of Performances 

Figures 3 and 4 show the MSE of WNN and NN, respectively. Figure 3 indicates that WNN has a 
lower MSE than NN. It is obviously demonstrated that WNN has a lower MSE value than neural network, 
which indicates the high performance of the classification, which means a high performance in the diagnosis 
of prostate cancer. As a result, WNN has a higher convergence rate compared to NN. 
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5.2. Classification Experimental Results 

Table 2 shows that the classification results of WNN outperform the classification results obtained 
by NN based on several cases of Prostate cancer patients. Furthermore, the results of the cases with ages 
above 50 and PSA over 4 ng/ml indicate a higher classification results of WNN compared to NN. 


Table 2. Classification of WNN and NN 








Name Age (PSA) Volume Output Output Case 
(cm) WNN. % NN.% 
Case 1 60 1.06 8*6 3.6 6.6 normal 
Case 2 84 0.39 10*7 30.61 45.61 normal 
Case 3 72 95.11 93.69 90.69 Abnormal 
Case 4 60 161.9 92.6 82.6 Abnormal 
Case 5 70 11.3 (7-15) 99.99 93.99 Abnormal 
Case 6 85 54.9 - 99.69 92.69 Abnormal 
Case 7 50 TA8 (10-13 24.6 24.6 Abnormal 
Case 8 75 16.2 99.99 99.99 Abnormal 
Case 9 67 14.11 99.88 96.88 Abnormal 
Case 10 67 6.9 99.99 92.99 Abnormal 
Case 11 716 100 92.6 90.6 Abnormal 
Case 12 61 8.28 99.9 90.9 Abnormal 
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5.3. Computational complexity 

Computational complexity is considered as a significant factor for evaluating the performance of the 
network. Table 3 illustrates, the CPU time utilization in the same processor during the training for each 
approach. The installed memory (RAM) has 3GB using Intel Core 13 M CPU 370 2.40 GHz processor. 
The time utilization results of the CPU indicate the speed efficiency of WNN than NN. Furthermore, it has 
been found that WNN has fewer operations than NN method. Therefore, WNN is not as complex as NN. 


Table 3. CPU utilization time 
Training Approach WNN NN 
CPU time(sec) 10.190 18.22 











6. CONCLUSION 

In this article, WNN is used for the diagnosis of prostate cancer. The Morlet function is deployed in 
the hidden layer as an activation function while BP method is applied for training the network. The results of 
the simulation show that WNN performance works much better than neural networks in the classification of 
prostate cancer. The results show that WNN converges faster than NN, and the convergence rate of MSE is 
slower at 1.605e-10. In addition, WNN introduces better classification values and exhibits less computational 
complexity than NN. However, compared to NN, the WNN needs more time to achieve the best value of 
MSE because it has a lower learning rate to obtain higher convergence rate. 
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